Picture for Cheng Yang

Cheng Yang

Do Text Edits Generalize to Visual Generation? Benchmarking Cross-Modal Knowledge Editing in UMMs

Add code
May 30, 2026
Viaarxiv icon

SwanVoice: Expressive Long-Form Zero-Shot Speech Synthesis for Both Monologue and Dialogue

Add code
May 29, 2026
Viaarxiv icon

ParaTool: Shifting Tool Representations from Context to Parameters

Add code
May 28, 2026
Viaarxiv icon

RelPrism: A Multi-Faceted Pre-training Framework with Self-Generated Tasks for Relational Databases

Add code
May 22, 2026
Viaarxiv icon

Latent Action Reparameterization for Efficient Agent Inference

Add code
May 19, 2026
Viaarxiv icon

RS-Claw: Progressive Active Tool Exploration via Hierarchical Skill Trees for Remote Sensing Agents

Add code
May 13, 2026
Viaarxiv icon

Uno-Orchestra: Parsimonious Agent Routing via Selective Delegation

Add code
May 06, 2026
Viaarxiv icon

From Experience to Skill: Multi-Agent Generative Engine Optimization via Reusable Strategy Learning

Add code
Apr 21, 2026
Viaarxiv icon

GeoAgentBench: A Dynamic Execution Benchmark for Tool-Augmented Agents in Spatial Analysis

Add code
Apr 15, 2026
Viaarxiv icon

The Latent Space: Foundation, Evolution, Mechanism, Ability, and Outlook

Add code
Apr 02, 2026
Viaarxiv icon